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The augmented GARCH model is a unification of numerous extensions of the popular and widely 
used ARCH process. It was introduced by Duan and besides ordinary (linear) GARCH processes, 
it contains exponential GARCH, power GARCH, threshold GARCH, asymmetric GARCH, etc. 
In this paper, we study the probabilistic structure of augmented GARCH(1, 1) sequences and the 
asymptotic distribution of various functionals of the process occurring in problems of statistical 
inference. Instead of using the Markov structure of the model and implied mixing properties, 
we utilize independence properties of perturbed GARCH sequences to directly reduce their 
asymptotic behavior to the case of independent random variables. This method applies for a 
very large class of functionals and eliminates the fairly restrictive moment and smoothness 
conditions assumed in the earlier theory. In particular, we derive functional CLTs for powers 
of the augmented GARCH variables, derive the error rate in the CLT and obtain asymptotic 
results for their empirical processes under nearly optimal conditions. 

Keywords: Berry-Esseen bounds; empirical process; GARCH; strong approximation; weak 
invariance principles 

1. Introduction 

The seminal work of Engle [17] gave a new impact to the theory of time series analy- 
sis. Engle introduced the ARCH (autoregressive conditionally heteroscedastic) process, 
which allows the conditional variance of the time series to change as a function of past 
observations. In past decades, this model has been widely used in the econometrics litera- 
ture to describe financial data showing time- varying volatility. Starting with the GARCH 
model of Bollerslev [8], a great variety of generalizations and extensions of the ARCH 
model have been introduced and studied. Duan [16] defined a very general model, the 
so-called augmented GARCH process, which contains most of the known GARCH models 
as special cases. 

For some random variable Z , we define D(Z) as the smallest (finite or infinite) interval 
such that Z e D(Z) a.s. 
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Definition 1. Let g(x) and c(x) be real-valued and measurable functions and assume 
that 

{£k, k£Z} is an i.i.d. sequence. (1) 
Assume, further, that the stochastic recurrence equation 

X k = c{£ k -x)X k _ 1 +g{Ek-x), keZ, (2) 
has a strictly stationary solution and assume that 

A : R + — > D(Xq) is an invertible function. (3) 
Then, an augmented GARCH(1, 1) process {yk, — oo < k < oo} is defined by the equation 

yk = o-k£k, (4) 

with a\ = A- 1 (X k ). 

Necessary and sufficient conditions for the existence of a strictly stationary solution of 
(2) are given in Aue et al. [1] and Duan [16]. In fact, (2) defines a stochastic recurrence 
equation of the type treated in [9] and [10]. For the convenience of the reader, we state 
some important results in Section 2 below. 

Statistical inference for augmented GARCH sequences requires the study of the asymp- 
totics of the process. For example, detecting structural breaks in GARCH models via 
CUSUM or MOSUM statistics or determining the limit distribution of the Dickey-Fuller 
statistics used in unit root testing in AR-GARCH processes requires functional CLTs 
for partial sums of various functionals of the process. A possible method to prove such 
results is to use the Markov structure of the model to obtain mixing properties and then 
to employ the theory of mixing variables to deduce the desired asymptotic results. A 
detailed study of /3- mixing properties was undertaken by Carrasco and Chen [11]. Mix- 
ing properties for ordinary GARCH(p, q) sequences were derived by Basrak et al. [3]. An 
overview of different dependence structures occurring in econometric time series mod- 
els can be found in Nze and Doukhan [24]. Besides mixing, they consider, for example, 
NED (near-epoch dependence), association and (tp,£, 0)-weak dependence. An elaborate 
treatment of the latter concept with applications is presented in Dedecker et al. [13]. The 
great advantage of this approach is that once a good dependence condition is obtained, 
many asymptotic results (such as functional CLTs) follow directly from a general the- 
ory, without any additional work. On the other hand, the class of "ready to use" limit 
theorems for dependent sequences is limited and the verification of many well-known 
dependence measures (especially mixing properties) is usually difficult. Hence, to prove 
the availability of a specific dependence structure often requires fairly restrictive moment 
and smoothness conditions on the underlying process. For example, in the case of the 
AR-GARCH model, all existing results on the Dickey-Fuller test, as well as the underly- 
ing functional CLT's, require the existence of four moments of the process. Technically, 
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the existence of moments of yo is a restriction on the functions c(-) and g(-) in (2). Es- 
pecially in higher order GARCH models, it is very difficult to connect the moments of 
yo with the specific model (cf. Ling and McAleer [22]). In this paper, we derive diverse 
independence properties of GARCH and related models and use these results to directly 
reduce their asymptotic behavior to that of independent random variables, extending 
their scope of application and eliminating the restrictive moment and smoothness con- 
ditions required by existing theory. This method, adapted from Bcrkcs and Horvath [5], 
depends on the observation that truncating the infinite series in explicit representations 
for a GARCH process {y n ,n> 1} leads to a perturbed process {y n ,n> 1}, whose finite 
segments {y n , 1 < n < N} arc m-dependent, with m = m(N). This fact leads, via simple 
blocking arguments, to a large class of asymptotic results under optimal conditions. In 
particular, we will derive functional CLT's for powers of augmented GARCH variables, 
requiring only the existence of finite variances (Section 4.1). Further, we prove a strong 
approximation of the empirical process by a two-parameter Gaussian process under log- 
arithmic moment assumptions (Section 4.3). Our method will also yield essentially sharp 
convergence rates to the normal law (Section 4.2). It is very likely that this approach can 
be used in many other situations as well. 

The structure of our paper is as follows. In Section 2, we formulate preliminary results 
concerning the existence of a stationary solution to (2) and conditions for E|j/o| p < oo (p> 
0). In Section 3, we study the dependence structure of augmented GARCH sequences. We 
deduce an approximation of the original random variables by an m-dependent sequence 
and give estimates for the resulting error. These structural results are applied in Section 4 
to derive diverse limit theorems under very mild, or even optimal, assumptions. Finally, 
Section 5 contains proofs. 

2. Preliminaries 

We give conditions that ensure the existence of a strictly stationary solution of the 
stochastic recurrence equation (2). Since we assume (1) and (3), this will obviously pro- 
vide a strictly stationary solution of (4) . The following existence theorem is due to Brandt 
[10]. Let log + .T = log(xVl). 

Theorem 1. Assume that (1) holds and that Elog + |<?(eo)| and K\og + \c(eo)\ are finite. 



If 



Elog|c(e )| <0, 



(5) 



then for every k £ Z, the series 



oo 




(6) 



i=i 



\<j<i 



is convergent with probability one and {Xk,k € Z} is the unique strictly stationary solu- 
tion of (2). 
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If {Xk,kd Z} is a strictly stationary solution of (2), then it is called non-anticipative 
if Xk is independent of <r{£j,j > k). Following Bougerol and Picard [9], we call the model 
(2) irreducible if there is no trivial solution Xk = x£l such that x = c(eo)x + g(so) 
a.s., hence, we exclude the case where g(s ) = x(l — c(e )) for some i£R. The following 
converse to Theorem 1 is a special case of Bougerol and Picard [9], Theorem 2.4. 

Theorem 2. Assume that (1) holds and that the model (2) is irreducible and has a 
strictly stationary non-anticipative solution. Then, the series in (6) converges with prob- 
ability one and is the unique strictly stationary solution of (2). 

The series representation (6) is essential for our approach since it will provide a con- 
struction method for approximating m-dependent random variables. From Theorem 2, 
we learn that whenever a non-anticipative and strictly stationary solution of (2) exists, it 
has the form (6). Furthermore, Theorem 1 provides easy conditions for the convergence of 
this series. Hence, instead of requiring the existence of a solution of (2), we will hereafter 
assume a priori that the series in (6) is convergent and that 

oo 

i=l l<j<i 

We shall abbreviate this compound assumption as simply "(7) holds". Also, note that 
(7) is always a formal solution of (2). 
The next theorem is only a slight modification of a result of Aue et al. [1]. 

Theorem 3. Assume that (1) and (7) hold. If, for some fi > 0, 

E| 5 (£ )| M <oo and E|c(e )|" < 1, (8) 
then E|A((Tq)| m < oo. Conversely, if 

c(e o )>0 and g(e o )>0 (9) 
hold, then (8) is necessary in order that EIA^q)!^ < oo. 

Remark 1. The sufficiency of (8) for EIA^q)!^ < oo is also given in Carrasco and Chen 
[11], but under additional assumptions. They require fi G {1, 2, . . .}, Eeo = 0, Eeq = 1 and 
a continuous density p(-) of Sq such that p(x) > for all x € ffiL 

In [11] and [16], several examples of augmented GARCH(1,1) processes appearing in 
the literature are given. We refer to these articles for an overview. It should be noted 
that most commonly known examples (as well as all examples given in [11, 16]) treat so- 
called polynomial or exponential GARCH processes. We say that an augmented GARCH 
sequence is a polynomial (resp., exponential) GARCH process if A(x) = x s with 5 > 
(resp., A(x) = log a:). This special form of A is motivated from a Box-Cox transformation 
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of the observation. In order to avoid cumbersome calculations for only a small gain, in 
this paper, we shall solely deal with these two types. Our approach will work with more 
general A if we require certain smoothness conditions as, for example, in Aue et al. [1]. 
However, in a generalized formulation, the price will be a loss of accuracy of our results. 
The main advantage of a special choice of A is that we can very precisely transfer different 
properties which are derived for A(cr^) to the volatility process {<rf}. 

The following corollary to Theorem 3 characterizes the existence of E|yo| 2l/ {v > 0) for 
some polynomial GARCH sequence. (See also Ling and McAleer [23].) 

Corollary 1. Assume that (1), (4), (7), (9) and E|eo| 2l/ < oo hold. Assume, further, 
that A(x) = x s (5 > 0). Then, for some v > 0, we have E|yo| 2l/ < oo if and only if 

E|c(£o)r /<5 < 1 and E| 5 (e )r / ' 5 < oo. (10) 

For polynomial GARCH processes, the left-hand side of (2) is always positive and thus 
it is natural to assume (9). Note, however, that c(eq) > and g(eo) > are not required 
for A(ofc) > 0. A non-trivial example is where g(x) = 1 and the distribution of c(eo) is 
concentrated on the interval [—1/2, 0]. It follows from the definition of A that E|yo| 2l/ < oo 
if and only if E\e \ 2l/ < oo and E\A(a%)\"/ s < oo, which, in view of Theorem 3, proves the 
corollary. 

Proposition 1 below gives a moment criterion for exponential GARCH processes. For 
the special case c(sq) = c with |c| < 1, we refer to He et al. [21]. 

Proposition 1. Assume that (1), (4), (7) and E|eo| 2M < oo hold. Assume, further, that 
A(x)=\ogx. If 

\c(e )\<c<l and Ec^ 9(6o)l < oo, (11) 
then E|yo| 2AI < oo. On the other hand, if (9) holds, then 

P(\c(eo)\ < 1) = 1 (12) 

and the second assumption in (11) are necessary to assure E|j/o| 2m < oo. 

The sufficiency part is implicit in [1]. To prove the other direction, we observe that by 
(7) and the non-negativity of c(eo) and g(so), it follows that 

E|yo| 2AI =E|£ | 2 ^Ecxp(M(^o)) > E| £o | 2/i Eexp(/i 9 ( £ o)) (13) 

and hence Ee At9( - £o ^ < oo is necessary. Further, it is clear from (13) that A(ctq) must have 
moments of all orders. If we assume that (12) does not hold, then there is some 5 > 
and some a > such that P{c(eo) > 1 + 5} = a. Consequently, Ec(eo) p > + <5) p > 1 
for sufficiently large p. By (7) and the assumption c(eo),ff(eo) > 0, we have 

oo 

EA(a 2 )? > ^E 5 ( eo )P(Ec( eo ) P r 1 =oo, 

i=l 
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which contradicts the fact that EA((Tq) p < oo for any p > 0. 

3. Dependence structure 

In this section, we show that augmented GARCH sequences (yk) can be closely approxi- 
mated by m-dependent random variables (ykm)- The error of the approximation will be 
measured by the L2-distance 

\\y k - y km h = (nvk - y km \ 2 ) 1/2 

and by probability inequalities for 

P(\Vk-Vkm\ >e)- 

Under the present assumptions, such types of inequalities are relatively easy to obtain, yet 
they have very strong consequences. In Section 4, we show that the closeness of the yk's 
to independence allows the study of asymptotic properties with classical methods and the 
derivation of sharp limit theorems via a unified approach. Since, in many applications, it is 
not the original observations that are analyzed, but the transformed variables % = f(yk), 
where / comprises the functions 

f{x) = \x\ v or f{x)=sign{x)-\x\ v (i/>0), (14) 

we shall derive our results for (rjk). The basic idea for the approximation is this. If (7) 
holds, then 

«^ = A- 1 ( £ g(e k -i) J] <e k -M. 

\l<j<oo l<j<i / 

We now define 

Vkm^Sk^km, (15) 

where a\ m is the solution of 

H4 m )= E 9{ek-i) II < £ k~i)- ( 16 ) 

l<i<m l^i<^ 

It follows that (ykm) defines an m-dependent sequence. Usually, the series in (16) con- 
verges very fast and hence considering the finite sums will only cause a small error. 

Lemma 1. Assume that (1), (4), (7) and (9) hold. Assume, further, that K(x) = x s . Let 
f be given as in (14) and set 

Vk = .f(yk) and T]km = fiUkm)- (17) 
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_// IE 1 770 1 2 < 00, then there are constants C\ > and < g < 1 such that 

\\Vk-Vk m h<C ig m (m>l). 

Lemma 2. Assume that (1), (4), (7) and (11) hold. Assume, further, that A(x) =logx. 
Let rjk and r\km be defined as in (17) and assume that E|?7o| 2 < 00. If pL> v, then there 
is some C2 > such that 

hk-Vk m h<C 2 c m (m>l). (18) 
(Here, fi and v come from (11) and (14) resp.). 

Remark 2. Under the assumption E|?7o| p < 00 for p > 1, some trivial changes in the 
proofs also yield the exponential decrease of the L p -error. 

The conclusion of Lemmas 1 and 2 is that under some minor regularity assumptions - 
providing manageable conditions for the existence of IE 1 770 1 2 (see Corollary 1 and Propo- 
sition l)-the Li approximation error via the perturbed sequence decays exponentially 
fast. An immediate consequence is the short-memory behavior of the sequence (%), which 
can be characterized by the convergence of 



^2 |Cov(77o,?7fc)| <oo. 



fc=i 

In fact, for m = m(k) = k — 1, we get, by the independence of r]o and rjk m , that 
I Cov (»7o,»7k)l = |Cov(t7o,?7a : - r)km)\ < 4||??o||2||% - Vkmh- 

Having established the L p -error of the approximating r.v.'s, one can immediately es- 
timate P(\A(al) — A(o-£ m )| > e) and P(\yk — Ukm \ > e) by using the Markov inequality. 
However, with a little more effort, one can drastically weaken the associated moment 
requirements. Observe that under (7), 



|A(^)-A(ai„)|=nic(e fc -i)||A((7 



2 ) 

k—m ) 



If (5) holds, then we can choose ci > such that — c\ — Elog|c(eo)| > 0. From the sta- 
tionarity of A(cr 2 ), we get 

P^ni c (^-i)ll A (^-m)l>e- cim/2 ^ 

< P (j2( lo S \c(e k -i)\ - Elog |c(e )|) > (-ci - Elog \c(e )\)mj + P(\A(a 2 )\ > e^ m ' 2 ). 

Of course, the first term in the previous line only makes sense if Elog |c(eo)| > —00. These 
simple observations are summarized in the next lemma. 
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Lemma 3. Assume that (1) and (7) hold. Assume, further, that 

-oo<Elog|c(£ )| <0 (19) 

and set 

m 

T ro = 53(log|c( ei )|-Elog| C ( e ,)|). 
1=1 

There are then constants C 3 ,C^ > which do not depend on m such that for sufficiently 
small a > 0, we have 

P(\A(a 2 k ) - A(aL)| > *~ am ) < P(T n > C 3 m) + F(|A(a 2 )| > e^ m ). 

Lemma 3 shows that the error rate depends on the tail probabilities of T m and | A(<Tq)| . 
Both can be obtained by classical methods for independent random variables. Clearly, 
these probabilities are determined by the specific distributions of c(eo) and g(so)- The 
next two lemmas give more concrete estimates under different moment assumptions. 

Lemma 4. Assume that the conditions of Lemma 3 hold. If for some fi>2, 

E|log|c(eo)||"<oo and E(log+ \g(e W < oo, (20) 
then for a sufficiently small a > ; there is some constant C5 such that for every m > 1, 

P(\A(al) A(atJ\ > e~<* m ) < C b m^/\ 
Lemma 5. Assume that the conditions of Lemma 3 hold. If for some fi > 0, 

E|c(e )| AI <l and E|.g(e )| At < 00, (21) 
then for a sufficiently small a > 0, there are constants Ce, p > such that for every m > 1, 
P{\K{al) AK 2 JI > c- m ") < C 6 e-» m . 

Finally, we consider the estimation of P(\yk — Vkm\ > e )- As in Lemmas 1-2, we will 
be a little bit more general, considering the transformed variables 77^ and distinguisting 
between polynomial and exponential GARCH. 

Lemma 6. Assume that (1), (4), (7), (9) and (19) hold and that (20) holds with some 
fx > 2. Assume, further, that A(x) — x s and let r\k and rjkm be given as in (17). Then, 
for sufficiently small a > 0, there is some constant C-j such that 

P(\Vk ~ Vkm\ > e- am ) < P(\e \ > e am ) + C 7 m^/ 2 (m > 1). 



Augmented GARCH sequences: Dependence structure and asymptotics 



551 



Lemma 7. Assume that (1), (4), (7) hold and that (21) holds with some [i > 0. Assume, 
further, that A(x) =logx and let rjk and % OT be given as in (17). Then, for sufficiently 
small a > 0, there is some constant C% such that 

P{\m ~ Vkm\ > c- am ) < P(\e Q \ > e am ) + C 8 m-» (m > 1). 

4. Applications 

4.1. The functional central limit theorem 

Statistical inference based on GARCH sequences often requires the establishment of 
functional central limit theorems for partial sums processes like 

s„(t) = 4= E (f(yi)- E f(y°)) 

v l<i<nt 

or 

Z ^ = \ £ tt{°i)-Ef{o-o)). 

* l<i<nt 

For example, to derive the asymptotic distribution of the CUSUM or MOSUM statistics 
applied in the theory of change-point detection, an FCLT is needed. Also, the determi- 
nation of the asymptotic distribution of the Dickey-Fuller statistic for the unit root test 
in an AR-GARCH model requires an FCLT. Several authors have obtained functional 
limit theorems for S n (t) with f(x) = x, f(x) — \x\ or f(x) = x 2 under various conditions. 
Most FCLTs in the literature assume at least four moments of the GARCH variables 
(cf. Davidson [12], Hansen [19]) instead of the more desirable assumption of solely a 
finite variance. Berkes et al. [6] noted that in the special case f(x) = x and Eeo = 0, the 
envisaged result follows from an FCLT for crgodic martingale difference sequences (cf. 
Billingslcy [7], Theorem 23.1) under the optimal condition Ei/q < oo. However, if Eyo 7^ 
or if we are, for example, interested in the squared GARCH sequence, then the martin- 
gale structure no longer applies. Giraitis et al. [18] point out the important association 
property of the ARCH(oo) model, which, when specialized to the GARCH case, yields 
an FCLT for the sequence y\ under Ey| < oo. Since association is preserved by non- 
decreasing transformations, an FCLT for the sequences (|j/fe| 1 ') follows if one shows that 
Sfc-oo Covflj/ol", \Vk\ v ) < oo- Although the convergence of the last series is implied by 
Lemmas 1 and 2, this approach will not lead to the desired goal here. Due to the general 
form of the functions c(x) and g{x) in (2), it is not clear whether the association property 
of the independent e^'s is inherited as it is in the ARCH(oo) case. Also, for the vari- 
ables sign(yfc) • |j/fe|", this method seems not to be applicable. A further weak dependence 
condition has been proposed by Doukhan and Louhichi [14] (see also Doukhan and Win- 
tenberger [15]), the so-called (6, £, i/>)-weak dependence. Roughly speaking, a sequence is 
(0, £, ^-dependent if for all n > 1 and all bounded Lipschitz functions /i,fc:R"->R, the 



552 



S. Hormann 



covariances Cov(/i('past'), /c('future')) < ip{-)0{-). Here, ip and 9 are functions, depending 
on the choice of 'past' and 'future'. If the covariance relation holds with proper ijj and 8, 
this structure can be used to verify an FCLT (see Nze and Doukhan [24], Section 5.1.6). 
Berkes et al. [6], Theorem 2.9, applied this method to derive the FCLT for the squares 
of some GARCH sequence under the condition E|yo| K < oo for some k > 8. 

Here, we will obtain the FCLT for the partial sum processes S n (t) (resp., Z n (t)) by as- 
suming only the necessary assumption Ef(yo) 2 < oo. Let denote weak convergence 
of a random function in the Skorokhod space T>[0, 1]. 



Theorem 4. Assume that (1), (4), (7) and (9) hold. Assume, further, that A(x) = x s 
(5 > ) and that f is given as in (14). If E/(?/o) 2 < oo, then 

T 2 =Var/(2/o) + 2 £ Cov(/(y ),/(y fe )) (22) 

l<k<oo 

is convergent and 

S n (t) V -^ ] TW(t), 

where {W(t),0 < t < 1} is a Brownian motion. An analogous result is valid for Z n (t). 
In particular, when the yk are GARCH(1, 1) variables, we obtain FCLTs 



or 



nt 

^-^l-^l)^ ] r,W{t) (23) 

V k=l 



(here, t^,t| are the corresponding variances arising from (22)) under the necessary con- 
l < oo and Ecr£ 



ditions Eyf. < oo and E<r| < oo, respectively 



Theorem 5. Assume that (1), (4), (7) and (11) hold. Assume, further, that A(x) =logx 
and let f be given as in (14). 7/E/(?/o) 2 < oo and [i> v, then the proposition of Theorem 4 
holds. (Here, /i and v stem from (11) and (14).,) 

Proof of Theorems 4 and 5. Set = r/fc — £770 and ^fc m = 7yfc rn — E^o, where rjk and 
r7fc m are given as in (17). Obviously, £k = <?(• • • ,£fc-i,£fc)j where g is some measurable 
mapping from the space of infinite sequences into R. According to Theorem 21.1 of 
Billingsley [7] , for the proof, it is enough to find measurable mappings g m from W m into 
R such that 

]T ||£o-Com||2<oc, (24) 

1 < rn < 00 
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where Com = 3m(e-m+i, £-m+2, ■ ■ ■ >£o)- By Lemma 1, £om satisfies this requirement and 
thus setting Com = Com yields the proof of Theorem 4. The proof of Theorem 5 is the 
same. □ 



4.2. Rates of convergence in the central limit theorem 

The functional CLT developed in Section 4.1 clearly implies the ordinary CLT for the 
partial sums and raises the question of the normal approximation error. If / is given as 
in (14), we will obtain the rate of convergence of 

s n = f(yx) + --- + f(yn)-nMf( yi ) 

to the normal distribution provided that 7713 = E|/(yi)| 3 < 00. For notational convenience, 
we will write S n instead of S n (f) and we set = B n (f) 2 = V&rS n (f) and 

n-l 

Pi = PlU) = Var/(y ) + 2^(1 - j/n)Cov(/( yi ), f(y j+1 )), 

i=i 

that is, 0^ = B^/n. 

Theorem 6. Assume that the conditions of Theorem 4 hold with the added assumption 
E|/(y )| 3 <oo. Then, 

/3:=lim/3„ exists (25) 
and if ft > 0, there is some C > such that 

\P{S n < xB n } - $(x)| < C ^° g ^} for alln>2 and x £ M. 

The constant C may depend on f, A, c, g and the law of £q. 

Theorem 7. Assume that the conditions of Theorem 5 hold with the added assumptions 
E|/(yo)| 3 < 00 and /1 > 3^/2. Then, the proposition of Theorem 6 holds. 



The additional assumptions in Theorems 6-7, compared with Theorems 4-5, arise 
from the requirement E|/(yi)| 3 < 00, which is the classical assumption in the context 
of Berry-Esseen bounds. The existence of the limit in (25) follows from Theorems 4- 
5. The rate (logn) 2 n -1 / 2 coincides with that obtained by Tihomirov [27], Theorem 2, 
for sequences which are /^-mixing with geometric rate. The proofs of Theorems 6-7 are 
given in Section 5.2. The main ingredients are a Berry-Esseen bound for m-dependent 
sequences (Tihomirov [27], Theorem 5) and Lemmas 1-2. Again, it becomes clear that 
m-dependence, rather than mixing, is the crucial structural property required in order 
to study the asymptotics of augmented GARCH variables. 
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4.3. Asymptotics of the empirical process 

In this section, we study the empirical process of augmented GARCH sequences. For this 
purpose, we define 

R(s,t)= ]T (I{F(y k )<s}-s), 

l<k<t 

where F(x) = P{yo < x). We will derive an almost sure approximation theorem for i?(s, t) 
by a two-parameter Gaussian process K(s,t), assuming only the existence of logarithmic 
moments of the e k and c(e k ), g(s k )- In the sequel, we shall use Yfe(s) = I{F(y k ) < s} — s. 
Our results rely on the following recent theorem, which is a special case of Theorem 2 in 
Berkes et al. [4]. We write a n <C b n if limsup^^^ \a n /b n \ < oo. A function F is Lipschitz 
continuous of order 8 if there is some constant k such \F(y) — F(x) \ < n\y — x\ e for all 
x,y G R. 

Theorem 8. Assume that {y k ,k € Z} is a strictly stationary sequence which can be 
represented in the form 

Vk = /(• • .,ek-i,£fc), 

where f : R N — > R is Borel measurable and {e k ,k E "E} is an i.i.d. sequence. Further, 
assume that the distribution function F(x) = P(yo < x) is Lipschitz continuous of order 
6. If there are measurable functions f m : R m — > R (m>l) such that 

P{\Vk - f m (e k - m +i, ■ ■ ■ ,£k)\ > m~ A ) < V min{A/0, B} > 4, 

£/ien i/ie series 

r(s,s')= ]T EYo(s)y fc (s') (26) 

— oo<A;<oo 

converges absolutely for every choice of parameters < s,s' < 1. Moreover, we can con- 
struct {yk,k € Z} and a two-parameter Gaussian process {K(s,t),(s,t) £ [0,1] 2 }, loii/i 
EA'(s, i) = and EA'(s, i)K(s', t') = (t A £')F(s, s'), on one probability space such that for 
some a > 0, 

sup sup |i?(s,t)- A(s,t)| = o(T 1/2 (logT)- Q ) a.s. 

0<KT0<s<l 

A look at our approximation results in Section 3 (Lemmas 6 and 7) shows that Theo- 
rem 8 can be almost directly applied for augmented GARCH sequences. The only thing 
that remains to be checked is the Lipschitz continuity of F. This will follow if, for exam- 
ple, we demand that H(x) = P(eo < x) is Lipschitz continuous of order 9 and that 

Eo-(7 e < oo. (27) 
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Indeed, since Eq and do are independent, we get 

\P(yo < x 2 ) - P{yo < xi)\ = \P(e < x 2 /a Q ) - P(e < xi/tr )\ 

<E\H(x 2 /a )-H(x 1 /ao)\ 
< LE<Tq 6 \x 2 — xi\ e . 

A sufficient condition for (27) is <To > 5 > 0, which is satisfied in all known examples for 
polynomial GARCH processes. In case of exponential GARCH, the conditions 

\c(e Q )\<c<l and Ecxp(0/2|g(e o )|) < oo 

suffice to ensure (27). To see this, note that 

Ecr/ <Ecxp((9/2|A(o- 2 )|). 

Hence, we use the same arguments as in the proof of Proposition 1 to show that the last 
term is finite. The following theorems summarize our observations. 

Theorem 9. Let A(x) = x s , 5 > 0. Assume that (1), (4) and (7) hold. Assume, further, 
that the distribution function H(x) = P(so < x) is Lipschitz continuous of order > 
and that (27) holds. If (20) holds and Elog + |e |^ _2 ^ 2 < oo with some fi > 10, then the 
proposition of Theorem 8 is valid. 

Theorem 10. Let A(x) =logx. Assume that (1), (4) and (7) hold. Assume, further, 
that the distribution function H(x) = P(so < x) is Lipschitz continuous of order 9 > 
and that (27) holds. If for some fi > 10, (21) and Elog + |eo|^~ 2 ^ 2 < oo hold, then the 
proposition of Theorem 8 is valid. 

An immediate consequence is the following two-parameter version of the empirical 
central limit theorem which can, for example, be used to detect a change in the structure 
of the GARCH sequence (cf. Bai [2]). 

Theorem 11. Assume that {yk, k £ Z} is an augmented GARCH sequence satisfying the 
conditions of either Theorem 9 or Theorem 10. Let {K(s, t), (s, t) £ [0, l] 2 } be a Gaussian 
process with EK(s,t) = and EK(s,t)K(s' ,t') = (t A t')T(s, s'). Then, 

nXl \~ E (I{F(y k )<s}-s)Y^ 2 K(s,t) (n-oo). 

V l<k<nt / 

As we pointed out in the introduction, our approach is, in many cases, superior to 
results following from mixing properties. Carrasco and Chen [11] verified /?-mixing with 
exponential decay for y^, an approach requiring a continuous positive density of Sq on 
(—oo, oo) and Ee = 0, Ee 2 , = 1 and 

|c(0)|<l, E|c(e )|<l, %(e )|<oo. (28) 
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Together with Theorem 2 in Philipp and Pinzur [26], this yields a similar result. Clearly, 
(28) is more restrictive than (20). Also, our results do not require a positive and con- 
tinuous density. In the literature, special attention has been paid to the IGARCH(1,1) 
process, that is, GARCH(1,1) with E(/3 + ael) = 1. For an IGARCH process, (28) does 
not hold, but our new results still apply. The moment conditions (28) for c(sq) and g(so) 
are slightly milder than those of Theorem 10 in case of exponential GARCH sequences. 

5. Proofs 

5.1. Perturbation error 

Proof of Lemma 1. We assume, without loss of generality, that k = 0. By our assump- 
tion (9), the term A(cr| m ) is non-negative and thus, using the special choice of / and A, 
we get 



Let us first consider the case v/6 < 1. From (7) and Minkowski's inequality (Hardy, 
Littlewood and Polya [20]), we infer that 



\Vo-Voml 2 = koHCA- 1 o A(<t 2 ))" /2 - (A- 1 o A(a 2 0m )f /2 \ 2 
< leoHCA- 1 o A( CTo 2 )r - (A- 1 o A(a 2 m )H 
= |e | 2 HA(a 2 )^-A(a 2 m )^|. 



(29) 
(30) 



|AK 2 )^-AK 2 m )^|< £ g{e-i) J] c(e- 3 ) 




OO 



Since we assume IE 1 770 1 2 < 00, it follows from Corollary 1 that 



i— m+1 l<j<* 



with some constant ci > and Qi = ILc(sq) u / s < 1. 

If v/S > 1, then by the mean value theorem, (30) is bounded by 



ko| 2 ^|AK 2 )|^- 1 |AK 2 )-A(a 2 m )|. 



From this, we get, by the Holder and the Minkowski inequalities, that 



E|?7o - ??om| 2 
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-3)) 



V, 



i=m+l \ l<j<i ) 



□ 



Proof of Lemma 2. Assume, again, that k = 0. We can formally derive (29), which 
reads here as 

\Vo -Vom\ 2 < \e \ 2v \e» l °z° 2 °-e vl °z° 2 °™\. (31) 
By the mean value theorem, we get 

je^iog^ 2 _ e "i°g<rg m | < ^logojj +e lylogCT »™)|logcr 2 -loga 2 m |. (32) 

Since v < \i, we can find some £ > such that + C) < M- It follows from Proposition 1 
that 

Ee Ki+C)ioga= <OQ 

and, similarly, we get 

Ee ^(i+C)i°g^ m <oo. 
Thus, the Holder inequality and the Minkowski inequality give 

Ee^°|log<7 2 -logcr 2 J 

< (Ee^ 1+0 lo ^«) 1/(1+0 (E| log a 2 - log^J (1+C)/C ) f/(1+f) 

OO 

< (Ee^ 1+olo ^o) 1/(1+c) (E|.g(eo)| (1+<:)/C ) C/(1+C) £ c 4 " 1 < const • c m . 

i— m+1 

The analogous result can be obtained if we replace exp (i/logu 2 ) by exp (z/logOgm)- Re- 
lation (18) now follows from (31) and (32). □ 

Proof of Lemma 4. We can make use of Lemma 3. By the first part of (20) and the 
Rosenthal inequality (see, e.g., Petrov [25], Theorem 2.9), we infer that P(T m > C^m) < 
cimT^/ 2 . Hence, it remains to estimate P(|A(og)| > c Cim ). We set 71 =Elog|c(eo)| an d 
72 = Elog + |<?(eo)|. Let < g < 1 such that logg — 71 = a > 0. If t is chosen sufficiently 
large, we get, from (7) and the Markov inequality, 



P(\A(a 2 )\>t) 
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<E P IfMI II \c(e- j )\>t(l- Q ) Q i - 1 



1=1 

oo 



l<j<i 



J2P log + | 5 (e- t )l+ J2 (log|c(e-i)|-7i)>Iog*-Pa + i(loge- 



7i J 



oo 



l<j<i 



<^P log+| 5 ( £ _ i )|- 7 2+ £ (logical -7i)> 



i<i<* 



< 



log + |.9(e- t )|-72+ £ ( lo gl c ( £ -j)|-7i) 

l<j<t 



log* 



log* 



From (20), and again from the Rosenthal inequality, we obtain 

oo 

P(|A(a 2 )|>i)<c 3 ]>> 



W2 /logi . 



i=l 



1=1 ^ ' 



□ 



Since Lemmas 5-7 follow by similar arguments, we will omit their proofs. 
5.2. A Berry-Esseen bound 

We shall prove Theorem 6; the arguments for Theorem 7 are identical. Our proof relies 
on the following result for m-dependent sequences. 

Lemma 8 (Tihomirov [27]). Let X\, X2, ■ ■ ■ be a strictly stationary sequence of in- 
dependent random variables with EJfi = such that 

E|Xi| 3 <oo. 

Let Bl = Var(S"„) and f3 2 = EXf + 2J2T=2 Ex i x k- V P 2 > 0, then there exist absolute 
constants C\ and C2 such that 

sup \P{S„ < B n x} - $ (x)\ < Ci m ' + C 2 -35 — ! — , 

where b rn = maxi< p < m+ i E 1/3 | J2l=i x v\ 3 - 
We use the notation of Section 4.3 and additionally define rj^ and r\hm by 
Vk = f{yk) - E/(j/fe) and i] km = f(e k a km ) -Ef(e k a km ). 
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Then, clearly, for every m > 1, {rjkrmk € Z} defines some strictly stationary and m- 
depcndcnt sequence. By a routine argument, we can extend the proof of Lemma 1 to 
show 

\\Vk — % TO ||2 < cig m for all m > 1 and k € Z, (33) 

for some positive constant ci and g> < 1. We choose some n € N which we assume to 
be fixed for the moment and set S' n = X)fe=i Vkm and {B' n ) 2 = YaxS' n with m = [tlogn]. 
The value of t will be specified later. Let (8k) be a sequence of positive reals. A simple 
estimate gives 

P{S n > xB n } < P{S' n > (x - 8 n )B n } + P{\S n - S' n \ > 8 n B n } 

and 

P{S' n > (x + 8 n )B n } < P{S„ > xB n } + P{\S n - S' n \ > 8 n B„}. 

A repeated application of the triangle inequality together with the latter estimates, 
shows, after a moment's reflection, that 

sup \P{S n < B n x] - $(x)\ < HP + + i?i 3) , (34) 

where 

RM = sup \P{S' n < B n (x + 8 n )} - $(x + 8 n )\, 

rv> = 8up|$(x + <y n ) - 

Ri 3 ^P{\S n -S' n \>8 n B n }. 

We shall now estimate Rn\ i = 1,2,3. Setting 8„ = n -1 / 2 , we get, by the mean value 
theorem, Rn < (27tn)~ 1 / 2 . From (25), we conclude that 2B 2 > f3 2 n for all n > n Q . Since 
we assume that /3 2 > 0, we infer from the Markov and the Minkowski inequalities and 
(33) that 

R^ = P{\S n - S;| > 8 n B n } < nE\S n - S' n \ 2 / B 2 n 
< n f ^ Hz/a - 2/ fc ™||2^ / S 2 < c 2 n 2 g m , 

where c 2 can be chosen such that it is not dependent on n. (The following constants Ci 
occurring in the proof are also independent of n.) Hence, if t > — 5/logp, we have Rn = 
0(?i -1 / 2 ). In order to estimate R n , we will use Lemma 8. This cannot be directly applied; 
we have to change from B n to B' n . By Cauchy-Schwarz and Minkowski's inequality again, 



560 



S. Hormann 



we derive 

\B 2 n - (B' n ) 2 \ < E\S n - S' n \\S n + S'J < c 3 \\S n - S'J 2 B n 

n 

< c 4 n 1/2 Y,hk- Vkmh < c 5 " 3/ V" /2 < cen- 1 , 
fe=i 

where the last inequality follows from the choice t > —5/ log g. Again using 2£? 2 > /3 2 n, 
if n > uq , we can reformulate the latter estimate to 

(1 + c 7 n- 2 )- 1/2 < |? < (1 - c 7 n- 2 )-^ 2 , (35) 

provided that c^n < 1. By routine arguments (cf. [25], Lemma 5.2), it follows that 

3 3 i*w-*(»)i<{g-/j/^;>, u-< P <i. <*> 

Trivially, we have 

R ( n ] < \P{S' n <B n (x + S n )}-^(B n /B' n (x + S n ))\ 
+ mB n /B' n (x + 5 n ))-$(x + 5 n )\ 

From (35) and (36), we get R\} 2) = 0{n- 2 ). Next, observe that E|r?i m | 3 < c 8 E\f( yi )\ 3 
and that as consequence of (35), we have (3' n ~ /3, where 



/ [tlogn] \ 

0'n = Varr7i m + 2 ^] Cov (r/i m ,^ m ) 



1/2 



By Lemma 8, it follows that .R^ = 0(n 1 / 2 log 2 ?i). Finally, combining our estimates 
for Rn X \ Rn 2 \ Rn^ and Rn^ with (34) completes the proof. 
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